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Abstract Sulfolobus synthesizes large amounts of small 
chromatin proteins Cren7 and Sul7d. The two proteins 
share overall structural similarity, but differ distinctly in the 
DNA-binding region between B3- and 64-strands. While 
Sul7d possesses a hinge of two amino acid residues, Cren7 
contains a flexible seven-residue loop (loop 63-64) in the 
region. Here, we report the role of loop 83-64 in the inter- 
action of Cren7 with duplex DNA. We show that all resi- 
dues with a large side chain on the loop, i.e., Pro30, Lys31, 
Arg33 and Lys34, contributed significantly to the binding 
of Cren7 to DNA. The three basic amino acids affected 
the ability of Cren7 to constrain negative DNA supercoils 


The atomic coordinates and structure factors for the GR- 
GTGATCAC and the Cren7-GTGATCAC complexes have been 
deposited in the Protein Data Bank (http://www.pdb.org) under 
accession codes 4R55 and 4R56, respectively. 
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in a residue number-dependent manner. The crystal struc- 
ture of a complex between a mutant Cren7 protein (GR) 
with loop 63-64 replaced by two residues (Gly and Arg) 
to mimic the hinge at the corresponding position in Sul7d 
and an 8-bp dsDNA has been determined. Structural com- 
parison between the GR-DNA and Cren7—DNA complexes 
shows that GR resembles Sul7d more than Cren7 in DNA- 
binding size and in the effect on the width of the major 
groove of DNA and the pattern of DNA bending. However, 
GR induces smaller DNA curvature than Sul7d. Our results 
suggest that Cren7 and Sul7d package chromosomal DNA 
in a slightly different fashion, presumably permitting dif- 
ferent chromosomal accessibility by proteins functioning in 
DNA transactions. 


Keywords Chromatin protein - Cren7 - Loop 63-64 - 
Protein—DNA complex - Sulfolobus solfataricus - Archaea 


Introduction 


Genomic DNA must be compacted to be accommodated 
by the cell while kept accessible to molecular machiner- 
ies for DNA transactions. Chromatin proteins are respon- 
sible for the compaction and organization of the chromo- 
somal DNA (Luijsterburg et al. 2008). Eukarya encode 
histones, which wrap DNA around the core octamer to 
form a nucleosome (Luger et al. 1997). In bacteria, the 
chromosomal DNA is compacted into a nucleoid by 
a number of small DNA-binding proteins, such as HU, 
IHF, H-NS, Fis and Lrp (Dame 2005; Wu 2004). How- 
ever, the situation is more complex in Archaea as differ- 
ent archaeal phyla may employ different sets of DNA- 
binding proteins (Zhang et al. 2010). Alba is a unique 
nucleic acid-binding protein highly conserved among the 
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domain Archaea (Bell et al. 2002). However, the protein 
appears to bind both DNA and RNA (Guo et al. 2003; 
Marsh et al. 2005). Therefore, the physiological roles 
of Alba remain to be understood. Euryarchaea produce 
archaeal histones capable of packaging DNA in a simi- 
lar manner to their eukaryal counterparts (Reeve 2003; 
Sandman and Reeve 2005). However, archaeal histones 
are absent in crenarchaea with the exception of some spe- 
cies (Cubonova et al. 2005). Several small DNA-binding 
proteins, including Sul7d, CCl and Cren7, have been 
isolated from crenarchaea (Grote et al. 1986; Guo et al. 
2008; Luo et al. 2007). While Sul7d and CC1 are phy- 
logenetically restricted to some crenarchaeal species, 
Cren7 is highly conserved in the phylum (Zhang et al. 
2012). It is suggested that crenarchaea may have evolved 
a Cren7-based DNA packaging strategy. 

Cren7, a small and basic chromatin protein, preferen- 
tially binds dsDNA over ssDNA, protects DNA duplex 
from thermal denaturation and constrains negative DNA 
supercoils in vitro (Guo et al. 2008). Sul7d, a 7-kDa chro- 
matin protein from Sulfolobus, resembles Cren7 in bio- 
chemical properties (Mai et al. 1998; Napoli et al. 2002). 
Both Cren7 and Sul7d compact DNA by inducing rigid 
bends with similar efficiencies (Driessen et al. 2013). In S. 
solfataricus P2 cells, Cren7 and Sso7d, a member of the 
Sul7d family, are both synthesized in abundance, amount- 
ing to ~1 and ~5 %, respectively, of the total cellular pro- 
teins (Guo et al. 2008). Therefore, it appears that the two 
proteins are both functional and possibly serve non-redun- 
dant roles in vivo. 
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Crystal structures of Cren7 and Sul7d alone and in com- 
plex with duplex DNA have been determined (Gao et al. 
1998; Robinson et al. 1998; Zhang et al. 2010). Despite 
the low similarity at the amino acid sequence level, the two 
proteins share a similar tertiary structure. However, com- 
parison of the DNA-binding surfaces of Cren7 and Sac7d, 
a member of the Sul7d family from S. acidocaldarius, 
reveals a remarkable difference in the loop region between 
63 and B4 strands (loop 83-64) (Zhang et al. 2010). A 
structure-based sequence alignment shows that Cren7 pos- 
sesses a long (seven residues) and highly flexible loop B3- 
B4, whereas Sac7d contains a small hinge (two residues) at 
the corresponding position (Fig. 1). Loop 83-64 of Cren7 
undergoes a significant conformational change upon bind- 
ing of the protein to DNA and covers about two base pairs 
in the crystals of the Cren7-DNA complex. The loop region 
is conserved among Cren7 homologs, and most of the resi- 
dues on the loop interact directly with the DNA strands. 

To elucidate the role of loop 63-84 in chromosomal 
organization by Cren7 and learn more about the poten- 
tial functional differences between Cren7 and Sul7d, it is 
essential to understand the effect of the loop on the inter- 
action of Cren7 with DNA. In the present study, we intro- 
duced point mutations and deletions into the loop region 
and determined the effects of the mutations on DNA bind- 
ing and supercoil constraining by Cren7. In addition, we 
solved the crystal structure of a complex between a Cren7 
deletion variant (GR), which mimics Sac7d in the loop 
region, with a duplex DNA fragment. Our results suggest 
that Cren7 and Sul7d may allow chromosomal DNA to be 
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Fig. 1 Sequence alignment of Cren7 homologs and Sac7d. a Struc- 
ture-based sequence alignment of Cren7 (PDB code: 3LWH) and 
Sac7d (PDB code: 1AZQ). The secondary structures of both proteins 
are indicated. b Sequence alignment of selected Cren7 homologs 
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from representative crenarchaeal genera. The secondary structures of 
Cren7 are indicated. The loop B3-p4 regions of Cren7 in both panels 
are marked 
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packaged in a slightly different fashion and, therefore, to 
exhibit different accessibility. 


Materials and methods 
Preparation of mutant proteins 


Point mutations were introduced into Cren7 by site-directed 
mutagenesis using the Fast Mutagenesis System (TransGen 
Biotech, China) with pET30a—Cren7, an expression plas- 
mid for Cren7 (Guo et al. 2008), as the template. The dele- 
tion mutants of Cren7 were prepared by overlapping PCR 
and subsequently inserting the resulting PCR fragment 
into pET30a (Novagen, USA). Primers used in the above 
assays are listed in supplementary table S1. All the muta- 
tions were confirmed by DNA sequencing. The mutant 
constructs were transformed into E. coli BL21 (DE3) cells, 
and recombinant proteins were overproduced and purified 
as described previously (Zhang et al. 2010). The purified 
proteins were lyophilized, resuspended in 20 mM Tris-Cl, 
pH 6.8, and 1 mM EDTA, and stored at —80 °C. 


Surface plasmon resonance assays (SPR) 


SPR experiments were carried out at 25 °C using the BIAc- 
ore 3000 instrument (BIAcore AB, Uppsala, Sweden). The 
running buffer contained 50 mM Tris—Cl, pH 7.5, 100 mM 
NaCl and 0.005 % (v/v) Tween 20. Two complementary 
oligonucleotides (supplementary table S1), including one 
with a biotin label, were immobilized on the SA sensor 
chip to 91-97 response units (RU). A blank flow cell was 
used as the reference to correct for instrumental and con- 
centration effects. Wild-type or mutant Cren7 at a concen- 
tration within a range spanning the Kp value for the inter- 
action of the protein with the DNA was injected over the 
DNA surface for 2 min at a flow rate of 30 pl/min. After the 
dissociation phase (2—4 min), bound protein was removed 
by a 30-s wash with 0.01 % SDS, followed by a 60-s buffer 
injection. Measurement at the lowest protein concentration 
in each SPR experiment was repeated once. Equilibrium 
and kinetic constants were calculated by a global fit to 1:1 
Langmuir binding model (BIA evaluation 4.1 software). 


Nick-closure assays 


Nick-closure assays were carried out as described previ- 
ously with some modifications (Guo et al. 2008). Plasmid 
pBR322 containing a single nick per circular molecule was 
prepared by digestion with the nicking enzyme Nb.Bpul0I 
(Fermentas). The nicked plasmid (0.2 ug) was mixed with 
various amounts of wild-type or mutant Cren7 in 40 mM 
Tris—Cl, pH 7.8, 10 mM MgCl,, 10 mM DTT, 0.5 mM ATP 
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and 100 ug/ml BSA, and incubated at 25 °C for 10 min. 
Ligation was then initiated by the addition of T4 DNA 
ligase (2U; Fermentas) to the mixture to a final volume of 
11 ul. After further incubation at 25 °C for 2 min, reactions 
were terminated with 5x stop buffer (250 mM EDTA and 
2.5 % SDS). Samples were deproteinized and analyzed by 
agarose gel electrophoresis in the presence or absence of 
chloroquine. Gels were stained in 0.5 ug/ml ethidium bro- 
mide and visualized under UV light. 


Size exclusion chromatography 


A 100-11 sample (~1 mM) of wild-type or mutant Cren7 
was subjected to size exclusion chromatography in 0.1 M 
NH,Ac buffer (pH 5.2) on a Superdex 75 column (GE 
healthcare) at a flow rate of 0.5 ml/min at 4 °C. Elution of 
the protein was monitored by absorbance at 280 nm. The 
void volume (v,) and total volume (v,) of the column were 
determined with blue dextran and xylene cyanol, respec- 
tively. The fractional retention (K,,,) was calculated using 
the formula Kay = (v.—v,)/(v,-V,), where v, is the peak elu- 
tion volume. A standard curve of K,,, against log M, was 
generated by using low molecular weight protein standards 
(GE healthcare). 


Crystallization, data collection and structure determination 


Cocrystallization screening was performed with GR, 
a deletion variant of Cren7, and each of the three 8-bp 
palindromic DNA sequences (supplementary table S1) 
by sitting-drop vapor diffusion. Crystals suitable for 
the X-ray diffraction were obtained only with the GR- 
d(GTGATCAC), complex. GR and a DNA duplex were 
mixed at a molar ratio of 2:1 to a final concentration of 
2 mM for the protein. The sample (1 ul) was mixed with 
the reservoir solution (1 ul) and equilibrated at 20 °C with 
0.2 M NH,Ac, 0.01 M MgAc,, 0.05 M sodium cacodylate 
trihydrate, pH 6.5, and 30 % PEG8000. Crystallization of 
Cren7—d(GTGATCAC), was performed under the condi- 
tions identical to those described previously for the Cren7— 
DNA complexes (Zhang et al. 2010). The crystals were 
mounted on nylon loops and immediately frozen in liquid 
nitrogen. The data for the complex was collected to 1.8 A 
at the Rigaku MicroMax-007HF (Rigaku, Japan) at 100 K, 
and the wavelength was 1.5418 A. The data were integrated 
and scaled with HKL2000 (Otwinowski and Minor 1997). 
The structures of Cren7—-d(GTGATCAC), and GR- 
d(GTGATCAC), were determined by molecular replace- 
ment using the program Phaser (McCoy et al. 2007) from 
the CCP4 program suite (Collaborative Computational 
Project N 1994) using the crystal structure of Cren7— 
d(GTGATCAC), (PDB code: 3LWH) as the initial model. 
The software packages Refmac (Murshudov et al. 1997) 
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Fig. 2 Kinetic characterization of the binding of wild-type and 
mutant Cren7 to DNA by SPR. Wild-type or mutant Cren7 was 
injected over immobilized DNA. The k, (recognition) is plotted 
against the ky (complex stability) on a logarithmic scale. The diago- 
nal lines (gray dotted lines) are isoaffinity lines representing the 
affinity Kp = kg/k, 


and Coot (Emsley and Cowtan 2004) were used to com- 
plete the model. Statistics for data collection and refine- 
ment are given in Table 3. The qualities of the final struc- 
tures were evaluated using PROCHECK (Laskowski et al. 
1993). DNA conformations were analyzed using the pro- 
gram X3DNA (Lu and Olson 2008). All images were pre- 
pared using Pymol (http://www.pymol.org) and ESPript 
(Gouet et al. 1999). 


Results 


Kinetic analysis of the interaction of loop 83-64 mutants 
with DNA 


Loop 63-64 of Cren7 comprises seven amino acid residues, 
i.e., Ala29, Pro30, Lys31, Gly32, Arg33, Lys34 and Gly35. 
We have previously shown that single-alanine substitution 
of Pro30, Lys31 or Arg33, each of which is located on the 
DNA-binding surface of Cren7, resulted in significant reduc- 
tion in binding affinity of the protein for dsDNA (Zhang 
et al. 2010). To learn more about the role of loop 63-64 in 
the interaction of Cren7 with DNA, we first introduced a 
single-alanine substitution at Pro30, Lys31, Arg33 or Lys34 
and double- and triple-alanine substitutions at the latter three 
basic amino acid residues, and compared the ability of these 
mutants to bind DNA and constrain negative DNA super- 
coils (supplementary figure S1, Fig. 2; Tables 1, 2). All the 
mutants showed reduced binding affinity for DNA, as com- 
pared to that of wild-type Cren7. The four single mutants, 
i.e., P30A, K31A, R33A and K34A, were similar in binding 
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Table 1 Amino acid sequences of loop 63-4 and its mutant forms 


Protein Sequence 

Cren7 28-L APKGRKG V-36 
LD 28-L A---------- G V-29 
GR 28-L ------ GR---- V-31 
GK 28-L ------ GK---- V-31 
Sac7d 26-V GK M-29 
P30A 28-L AAKGRKG V-36 
K31A 28-L APAGRKG V-36 
R33A 28-L APKGAKG V-36 
K34A 28-L APKGRAG V-36 
K31A/R33A 28-L APAGAKG V-36 
K31A/K34A 28-L APAGRAG V-36 
R33A/K34A 28-L APKGAAG V-36 
K31A/R33A/K34A 28-L APAGAAG V-36 
G32A 28-L APKARKG V-36 
G35A 28-L APKGRKA V-36 
AG32 28-L APK--RKG V-36 
AG35 28-L APKGRK-- V-36 


affinity with K, values in the 107” M range. However, the 
scattered distribution of the mutants in the k,/k, map sug- 
gests that these amino acid residues may differ in their roles 
in DNA binding by Cren7. The association rate constant (k,) 
of P30A was nearly identical to that of wild-type Cren7, but 
the dissociation rate constant (kq) of the mutant was four- 
fold as high as that of the wild-type protein. Presumably, the 
hydrophobic interactions between the side chain of Pro30 
and the bases of the DNA serve to stabilize the Cren7— 
DNA complex. On the other hand, mutation of each of the 
three basic amino acid residues was accompanied by a 10- 
to 20-fold increase in both k, and ky and, therefore, only a 
slight decrease in DNA-binding affinity. Among the three 
single mutations, substitution of Ala31 for Lys31 resulted in 
the largest decrease (~5 fold) in DNA-binding affinity, point- 
ing to the importance of Lys31 in the interaction of the pro- 
tein with the DNA. R33A showed a slight reduction (~1.3 
fold) in DNA binding, because its rates of both association 
and dissociation increased to similar extents. Notably, muta- 
tion of Lys34, which is not in contact with the DNA in the 
crystals (Zhang et al. 2010), also led to a twofold decrease 
in the binding affinity of the protein, raising the possibility 
that the residue interacts with the DNA in solution. This is 
consistent with the observation that alanine substitution of 
both Lys34 and Lys31 or Arg33 further reduced the binding 
affinity by two- or fivefold, respectively, as compared to that 
of K31A or R33A. In fact, the Kp values of the three dou- 
ble mutants (K31A/R33A, K31A/K34A and R33A/K34A) 
were similar (1.2—-1.5 uM). Moreover, the double mutants 
were clustered in the k,/k, map, exhibiting a tenfold decrease 
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Table 2 Kinetic analysis of 


eae Q Mutant Affinity Kp (M) Association rate k, (M7!s~!) Dissociation rate kg (s75) 

DNA binding by wild-type and 

mutant Cren7 by SPR Cren7-WT 1.34 x 1077 6.13 + 0.14 x 10° 8.21 + 0.34 x 1074 
LD* 2.10 x 107° ND? ND? 
GR 1.67 x 107° 2.63 + 0.04 x 10° 4.38 + 0.06 x 1077 
GK 2.78 x 107° 1.90 + 0.06 x 10° 5.29 + 0.08 x 107° 
P30A 4.73 x 1077 6.59 + 0.10 x 10° 3.12 + 0.04 x 1073 
K31A 6.46 x 1077 1.57 + 0.04 x 104 1.01 + 0.02 x 107? 
R33A 1.79 x 1077 7.75 + 0.16 x 104 1.39 + 0.02 x 107? 

a Equilibrium and kinetic rate K34A 2.36 x 1077 3.91 + 0.03 x 104 9.20 + 0.06 x 107° 

constants for this mutant were K31A/R33A 1.53 x 107° 1.58 + 0.03 x 10° 2.42 + 0.05 x 1077 

calculated using a steady-state K31A/K34A 1.46 x 10% 1.67 + 0.06 x 10° 2.45 + 0.16 x 107° 

Tepe (BIA evaluation  R33A/K34A 1.18 x 10-6 2.99 + 0.09 x 10° 3.53 +0.11 x 107° 
K31A/R33A/K34A 1.69 x 1075 1.58 + 0.05 x 107 2.67 + 0.06 x 107° 


> Not determined 


in DNA-binding affinity, as a result of a slight decrease in 
both k, and ką values, as compared to those of the wild-type 
protein. Substitution of all three basic residues with alanine 
reduced the binding affinity of Cren7 for DNA by over two 
orders of magnitude, emphasizing the critical role of the 
positively charged residues on loop 83-84 in the interaction 
of the protein with the DNA. Intriguingly, the drastic loss in 
DNA binding by K31A/R33A/K34A was due primarily to a 
40-fold decrease in the k, of the protein. A threefold reduc- 
tion was observed in the dissociation rate constant of the tri- 
ple mutant. It is worth noting that both the double and the 
triple mutants had similar dissociation rate constants. These 
results suggest that the positive surface electric potential in 
the loop B3-B4 region plays a key role in the Cren7-DNA 
interaction and contributes more to the association of Cren7 
to DNA than to the stabilization of the Cren7—-DNA complex. 


Contribution of the basic residues in loop 83-64 
to constraining of negative DNA supercoils by Cren7 


Cren7 is highly efficient in constraining DNA into nega- 
tive supercoils (Guo et al. 2008). Nick-closure assays 
were performed to determine the ability of the above loop 
63-64 mutants to constrain DNA supercoils (Fig. 3a). 
Singly nicked pBR322 was ligated in the absence or pres- 
ence of wild-type or mutant Cren7. It was noticed that the 
plasmid nick closed by T4 DNA ligase at 25 °C appeared 
to be slightly positively supercoiled rather than relaxed. 
This presumably resulted from the small difference in 
the experimental conditions (e.g., temperature, ionic 
strength) of the nick ligation assay and electrophoresis in 
agarose gel (Mai et al. 1998). As increasing amounts of 
the protein were added to the reaction and thus increasing 
amounts of negative supercoils were constrained by the 
protein, the ligated plasmid first lost positive supercoils 
and subsequently accumulated negative supercoils. The 
linking changes of the ligated pBR322 were measured by 


band counting and plotted against the protein/DNA mass 
ratios (Fig. 3b). The slope of each plot is taken as a meas- 
ure of the efficiency of the protein in constraining DNA 
supercoils. 

As shown in Fig. 3b, ~23 negative supercoils were 
constrained by Cren7 at the mass ratio of 1:1 (molecular 
ratio of ~436:1). This translates into binding of 207 to 253 
Cren7 molecules to a plasmid molecule according to the 
stoichiometry of 9-11 Cren7 molecules for a constrained 
supercoil (Zhang et al. 2010). Given a DNA-binding site 
size of ~8 bp, ~545 Cren7 molecules are required to satu- 
rate the 4361-bp pBR322 DNA molecule. Therefore, the 
binding of Cren7 to the plasmid DNA was far from satu- 
ration at the mass ratio of 1:1. This is consistent with our 
previous finding that Cren7 did not saturate the pBR322 
DNA until the mass ratio reached 8:1 (Guo et al. 2008). 
P30A constrained DNA supercoils as efficiently as wild- 
type Cren7. Since the former was fourfold weaker than the 
latter in binding to DNA, it seems that the ability of Cren7 
to constrain DNA supercoils is not directly related to the 
affinity of the protein to DNA. All of the remaining alanine 
substitution mutants of Cren7 showed significantly reduced 
ability to constrain negative DNA supercoils, as compared 
to that of the wild-type protein. There appears to be a gen- 
eral correlation between the number of basic residues on 
loop 63-64 and the supercoil-constraining ability of the 
protein variants. Cren7 containing one, two or three ala- 
nine substitutions at Lys31, Arg33 and Lys34 residues was 
approximately 2-, 4- or 16-fold, respectively, less efficient 
in constraining DNA supercoils than the wild-type protein. 
In other words, the efficiency of supercoil constraining by 
the protein decreased by approximately fourfold with the 
loss of each basic residue on loop 63-64. Notably, little 
difference in DNA supercoiling was observed between the 
three single mutants or between the two double mutants, 
although each of the two sets of the mutants differed among 
themselves in DNA-binding affinity. Therefore, we infer 
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Fig. 3 DNA nick-closure A 
assays. a Effect of mutations 
in loop 83-84 on the ability 
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from these observations that the positively charged residues 
in the loop region exert a similar impact on DNA geometry. 


The role of the glycine residues on loop B3-(4 in the 
folding of Cren7 


Among the seven residues in loop 63-64, two glycine 
residues (Gly32 and Gly35) are highly conserved in the 
Cren7 homologs (Fig. 1), suggesting their importance to 
the structure and/or function of Cren7. To understand the 
potential roles of the two residues, we performed alanine 
substitution and deletion analyses on each of the two gly- 
cine residues (Table 1). Four mutants, i.e., G32A, G35A, 
AG32 and AG35, resembled wild-type Cren7 in both 
DNA binding and supercoiling (Data not shown). This is 
not unexpected since the two glycine residues are not in 
contact with DNA in the Cren7—DNA cocrystals (Zhang 
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et al. 2010). Surprisingly, however, an unexpected protein 
peak, in addition to the typical monomeric Cren7 peak, 
was found when G32A, AG32 and AG35 were subjected 
to size exclusion chromatography (Fig. 4). Since the pro- 
tein variants eluted as monomers in the expected peak 
showed calculated molecular masses ranging from 8.86 
to 9.05 kDa, the protein species that came off the col- 
umn in the additional peak appeared to be dimers based 
on their molecular masses (18.6—21.0 kDa in size). Sub- 
stitution of Gly32 with alanine facilitated dimerization of 
the mutant, with ~15 % of the G32A molecules existing 
as dimers under our experimental conditions. Deletion of 
Gly32 increased the formation of dimers to ~30 %. G35A 
existed as monomers, while about half of the AG35 mol- 
ecules formed dimers. Therefore, the two glycine residues 
may serve to promote proper folding of Cren7, allowing 
the protein to exist as a monomer. 
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Fig. 4 Gel filtration chromatography of wild-type and mutant Cren7. 
All the protein samples were loaded at the same amount (~600 wg 
protein in 100 wl), as described in “Materials and Methods”. The 
Y-axis represents the absorbance at 280 nm. The scale for the Y-axis 
for each elution profile was adjusted to permit easy comparison of the 
elution times of the proteins. Molecular weight markers (conalbumin, 
75kD; ovalbumin, 44kD; carbonic anhydrase, 29kD; ribonuclease, 
13.7kD; aprotinin, 6.5kD) are indicated. The calculated molecular 
masses of the sample peaks are also shown 


Significance of the length of loop B3-64 


Unlike Cren7, Sac7d possesses a small hinge between B3 
and 64 strands. Sequence alignment of the two structur- 
ally homologous proteins reveals the similarity between the 
two residues (Gly32—Arg33) in the middle of loop B3-B4 
of Cren7 and the hinge region of Sac7d (Gly27—-Lys28). 
A question then arises as to whether the function of loop 
B3-B4 in Cren7 can be maintained by a much shorter hinge 
similar to that in Sac7d. To answer this question, we pre- 
pared three deletion mutant proteins of Cren7, denoted LD, 
GR and GK, respectively (Table 1). LD, constructed by 
deleting the Pro30—Lys34 region, had a short loop consist- 
ing of an alanine residue and a glycine residue. To mimic 
the hinge between B3 and £4 strands in Sac7d, GR and GK 
were made by substituting the seven-residue B3-B4 loop 
for a two-residue stretch (i.e., Gly-Arg for GR and Gly- 
Lys for GK). DNA binding by these mutants was analyzed 
by SPR (Table 2). In agreement with a previous report 
(Feng et al. 2010), the Kp of LD was ~20 uM, which was 
nearly 200-fold higher than that of wild-type Cren7 (~123 
nM). The substantial loss in the binding affinity of LD for 
DNA was almost entirely attributable to an increase in 
ka which was too high to be determined from the disso- 
ciation curves, indicating that the LD-DNA complex was 
very unstable. By comparison, the affinities of GR and GK 
for DNA were lower than the wild-type protein by only 


401 


Table 3 Data collection and refinement statistics 
GR-GTGATCAC Cren7-GTGATCAC 


Data collection 


Space group P2,212; C222, 
Cell dimensions 

a, b, c (A) 42.5, 41.8, 54.3 77.9, 77.9, 104.4 

a, P, y (°) 90.0, 90.0, 90.0 90.0, 90.0, 90.0 
Resolution (Å) 50-1.80 (1.86-1.80) 50-2.30 (2.38-2.30) 
R merge 0.05 (0.08) 0.04 (0.48) 
Ilol 40.6 (25.3) 37.6 (3.7) 
Completeness (%) 99.5 (99.0) 97.9 (96.3) 
Redundancy 6.7 (6.6) 7.2 (7.4) 

Refinement 

Resolution (A) 17-1.80 50-2.30 
No. of reflections 8922 13431 
Ryvork/Rree 0.219/0.252 0.200/0.232 
No. of atoms 

Protein 423 926 

DNA 322 644 

Water 72 116 
B-factors 

Protein 13.0 47.0 

DNA 11.5 33.2 

Water 15.6 42.1 
Rms deviations 

Bond lengths (A) 0.016 0.008 

Bond angles (°) 1.7 1.3 


~10- and ~20-fold, respectively. A slight decrease in k, 
and a modest increase in ky were observed for both mutant 
proteins. The difference between LD and GR (or GK) in 
DNA-binding affinity indicates that the positive charge in 
the loop region is crucial to the binding of Cren7 to DNA. 
Since LD binds to DNA less tightly than the triple substitu- 
tion mutant (K31A/R33A/K34A), in which the three basic 
residues on loop 63-84 were replaced with alanine, the 
size of the loop appears to contribute to the stability of the 
Cren7—DNA complex as well. This is also supported by the 
resemblance of both GR and GK to the double-substitution 
mutants (K31A/R33A, K31A/K34A and R33A/K34A) in 
DNA-binding kinetics and affinity, as shown by the cluster- 
ing of these mutants in the k,/k, map (Fig. 2). 

The deletion variants of Cren7 were also assayed for 
their ability to constrain DNA supercoils. All three mutants 
were able to constrain negative DNA supercoils (Fig. 3). As 
expected, LD was 32-fold less efficient than the wild-type 
Cren7 in constraining DNA supercoils. By contrast, the 
ability of GR or GK to constrain DNA supercoils was about 
fourfold lower than that of Cren7. It should be noticed that 
the difference in the ability to constrain negative supercoils 
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between wild-type Cren7 and the three variants was slightly 
underestimated since the comparison was made at the 
same protein/DNA mass ratio and there was a small differ- 
ence (~10 %) in molecular mass between wild-type Cren7 
(~6.5 kDa) and the three variants (~6 kDa), which corre- 
sponded to the protein/DNA molar ratios of 436:1 for wild- 
type Cren7 and 472:1 for the deletion variants, at the mass 
ration of 1:1. In addition, the efficiencies of GK and GR in 
constraining DNA supercoils were similar despite their dif- 
ference in DNA-binding affinity. Interestingly, both GR and 
GK constrained supercoils with slightly higher efficiencies 
than the double substitution mutants, although the former 
had lower affinity for DNA than the latter (Table 2). It is 
speculated that replacement of the long loop, which cov- 
ers 2 bp of DNA (Zhang et al. 2010), with a short hinge in 
Cren7 would reduce the binding size of the protein, allow 
increased binding density and thus enhance the efficiency 


A 


GTGATCAC 


Fig. 5 Crystal structure of the GR-d(GTGATCAC), complex. a 
Ribbon diagram of the GR-GTGATCAC complex. b Superposition 
of Cren7 and GR in complex with GTGATCAC, fitted by common 
protein main-chain atoms. c The DNA-binding surfaces of Cren7 and 
GR, with selected side chains labeled. The loop 3-84 regions of the 
two proteins are highlighted, and the residues of Cren7 and GR are 
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of the protein to constrain DNA supercoils. Taken together, 
our results show that replacement of loop 3-84 with a 
short hinge lacking a positively charged amino acid residue 
significantly impaired the ability of Cren7 to bind DNA and 
to constrain DNA supercoils, and this ability of the protein 
was partially restored if the hinge region contained a posi- 
tively charged residue. 


Crystal structure of the GR-DNA complex 


To determine the structural basis of the effect of replace- 
ment of loop B3-B4 with a short hinge on DNA binding 
and supercoil constraining by the protein, we attempted 
to co-crystallize GR with one of the three 8-bp double- 
stranded DNA fragments (supplementary table S1). The 
crystal structure of GR-d(GTGATCAC), was solved and 
refined to 1.8 A (Table 3; Fig. 5a). A single protein in 


GTGATCAC 


— Cren7-GTGATCAC 
— GR-GTGATCAC 
— Sac7d-GT AATTAC 


labeled in pink and blue, respectively. d Superposition of the Cren7— 
GTGATCAC (pink), GR-GTGATCAC (blue) and Sac7d—GTAAT- 
TAC (yellow; PDB code: 1AZQ) complexes fitted by all atoms from 
one of the DNA strands. e A detailed view of the DNA structures in 
d. The T2pA3 step is marked, and loop 3-84 and Lys31 of Cren7 
are also shown 
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complex with DNA is found in the asymmetric unit. The 
crystal structure of wild-type Cren7 in complex with the 
same DNA duplex was also resolved and refined to 2.3 A 
(Table 3; Fig. 5a). Two Cren7-GTGATCAC complexes are 
in the asymmetric unit and the root-mean-square deviation 
(rmsd) value between them is 0.016 A. 

Superposition of GR with wild-type Cren7 in complex 
with DNA, fitted by their a-carbons with a root-mean- 
square distance of 0.326 A, shows a similar overall struc- 
ture with conformational differences in the regions of 
N-terminal five residues and loop 63-64 (Fig. 5b). The 
GR-DNA complex also shares an overall structure with 
the Cren7—-DNA complex. GR binds DNA as monomers 
in the minor groove. The bound protein induces a sharp 
kink (~48°) in the DNA duplex toward the major groove 
by intercalation of the side chain of Leu28 into the G3A4 
step. The minor groove at the intercalating site is drasti- 
cally widened to an A-form-like conformation. Remark- 
ably, deletion of the long flexible loop leads to a reduc- 
tion in binding size for GR (~6 bp), as compared to that 
for wild-type Cren7 (~8 bp) (supplementary figure S2). 
The short loop between B3 and £4 strands in GR remains 
fixed in the minor groove, but Arg30 on the loop does 
not interact with DNA, as evidenced by the poor electric 
density of the side chain of Arg30 in the crystal lattice. 
Structural comparison between GR and Sac7d in com- 
plex with DNA shows similar conformations in the region 
between £3 and {4 strands (Fig. 5d). All DNA-interacting 
residues are located on the triple-stranded B-sheet in GR. 
GR is similar to wild-type Cren7 in DNA-binding surface 
except for the conformational changes in Lys24 and Val31 
side chains (Fig. 5c). Lys24 in GR contacts the phosphate 
group of C6 and the deoxyribose of T5 through hydropho- 
bic interactions instead of forming hydrogen bond to the 
O1P of C6 through its NH, group, as found in the Cren7— 
DNA complex. Val31 of GR is still in close contact with 
the Cl4pA15 step of DNA, as in the Cren7-DNA com- 
plex. The side chain of Val31, however, is rotated by ~90° 
clockwise and its hydrophobic interaction with the deoxy- 
ribose of C14 no longer exists. 
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Deformation of DNA by bound GR 


In the crystal structure, the DNA bound by GR is similar 
to that bound by wild-type Cren7 in global conformation 
except that the backbone in the T2pG3 region in the GR- 
DNA complex is not as deformed as in the wide-type pro- 
tein-—DNA complex (Fig. 5d, e). In fact, the local conforma- 
tion of this region in the DNA bound by GR resembles that 
in the Sac7d—GTAATTAC complex. The phosphate group 
between T2 and G3 in the DNA bound by wild-type Cren7 
is translocated as a result of the formation of a hydrogen 
bond to Lys31, resulting in distortion in the backbone. 
By comparison, the DNA backbone in the GR-DNA and 
Sac7d—DNA complexes is much smoother due to the lack 
of direct contacts between the loop and the backbone phos- 
phate group. 

As observed in the complex of wild-type Cren7 with 
DNA, binding of GR dramatically widens the minor groove 
to about 12.0 A at the site of intercalation, changing the 
local DNA conformation from B-form (6.0 A) to A-form 
(11.1 Å) (Arnott 1999). The major groove at this site is 
narrowed by about 2.6 A to 9.0 A, as compared to 11.6 A 
in B-form DNA. In comparison, the width of the major 
groove at the intercalation site in the wild-type Cren7—- 
GTGATCAC complex (12.0 A) remains unchanged. Nota- 
bly, GR alters minor and major grooves in the same manner 
as Sac7d, which widens the minor groove to ~12.9 A and 
narrows the major groove to ~8.1 A in the protein-GTAAT- 
TAC complex (Robinson et al. 1998). 

A sharp kink occurs at the G3pA4 step in both the GR- 
DNA and the wild-type Cren7—DNA complexes. The kink 
angle of ~48° in the GR-DNA complex is even larger than 
that in the wild-type Cren7-DNA complex (~45°). How- 
ever, the average roll angle of the DNA duplex bound by 
GR (~8.5°) is smaller than that bound by wild-type Cren7 
(~9.9°), indicating that the bound DNA is overall less 
curved in the GR-DNA complex than in the wild-type 
protein-DNA complex (Table 4). The reduction in DNA 
curvature in the GR-DNA complex is mainly due to nega- 
tive rolls at the G1pT2 and C6pA7 steps (about —4.0° and 


Table 4 DNA helical 


Step Cren7-GTGATCAC GR-GTGATCAC 

parameters of the Cren7— 

GTGATCAC and GR- Rise (A) Roll (°) Twist (°) Rise (A) Roll (°) Twist (°) 

GTGATCAC complexes 
GT/AC 3.14 2.21 29.97 3.43 —4.03 30.14 
TG/CA 2.93 8.56 27.61 2.83 7.94 30.45 
GA/TC 6.21 45.38 20.53 6.24 48.43 19.47 
AT/AT 3.12 6.06 23.09 3.22 4.93 26.01 
TC/GA 3.22 13.57 23.01 3.26 12.17 26.10 
CA/TG 3.30 —4.67 48.96 3.21 —10.37 46.24 
AC/GT 3.21 —1.71 30.04 3.37 0.49 31.76 
Average 3.59 9.91 29.03 3.65 8.51 30.02 
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—10.4°, respectively), since a positive roll of 2.2° and a 
slightly negative roll of about —4.7° are observed at the 
corresponding steps in the wild-type Cren7-DNA com- 
plex. The change in negative roll at the GlpT2 step in GR- 
DNA seems to have resulted from the loss of the contacts 
between GR and the G1:C16 pair of the DNA duplex (sup- 
plementary figure S2). On the other hand, the conforma- 
tional change of Lys24 side chain, which interacts directly 
with the phosphate group of C6, may be responsible for the 
increase in negative roll at the C6pA7 step in the GR-DNA 
complex (Fig. 5C; supplementary figure S2). 

Binding of GR also induces undertwisting of the DNA 
helix, which is believed to result in constraining of DNA- 
negative supercoils (Table 4). The base pairs at the site of 
intercalation show the smallest twist (~19.5°). The average 
twist for a single step is ~30.0° in the GR-GTGATCAC 
complex, which translates into ~12 bp per helical turn in 
bound DNA. Given the binding size of ~6 bp, binding of 
~14 GR molecules would constrain one negative supercoil. 
For the wild-type Cren7-GTGATCAC complex, the aver- 
age twist per base pair is ~29.0°, a number similar to that in 
the GR-GTGATCAC complex. However, due to its larger 
binding size (~8 bp), wild-type Cren7 is calculated to con- 
strain one negative supercoil for ~9 proteins. Therefore, the 
reduced efficiency of GR in constraining DNA supercoils, 
as compared to that of wild-type Cren7, appears to result 
from a decrease in both binding size and binding affinity of 
the protein. 


Discussion 


Cren7 and Sul7d, two chromatin proteins of similar bio- 
chemical properties and tertiary structures, are both pre- 
sent in abundance in Sulfolobus. Investigation of the dif- 
ferences between the two proteins is of importance to the 
understanding of their respective roles in chromosomal 
organization and other DNA transactions in vivo. Structural 
comparison between Cren7 and Sac7d reveals that the most 
distinguishing feature of the two proteins in DNA-binding 
surface is the size of the amino acid stretch between 63 
and £4 strands (Zhang et al. 2010). Notably, loop 63-64 
is highly conserved among Cren7 homologs (Fig. 1). In 
the present study, we show that loop B3-f4 serves a criti- 
cal role in the structural integrity of and DNA binding and 
supercoil constraining by Cren7. Our results provide clues 
to the differences between Cren7 and Sac7d in biochemical 
properties. 

All amino acid residues on loop 63-64 that have a 
relatively long side chain, i.e., Pro30, Lys31, Arg33 and 
Lys34, contribute significantly to DNA binding by Cren7 
(Table 1). Alanine substitution of Pro30 destabilized the 
Cren7—DNA complex, while mutation of the basic amino 
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acid residues affected primarily the binding step of the 
Cren7-DNA interaction. Interestingly, the positively 
charged side chains of Lys31 and Lys34 are not in direct 
contact with the DNA in crystals (Zhang et al. 2010). So 
the finding that their replacement with an uncharged ala- 
nine reduced the DNA-binding affinity of the protein sug- 
gests that the positive charges on the surface of loop B3-B4 
play a role in the interaction between Cren7 and DNA in 
solution. Further, nick-closure assays showed that the effi- 
ciencies of Cren7 mutants in constraining DNA supercoils 
were related proportionally to the number of basic resi- 
dues on the loop (Fig. 3). The decrease in the ability of the 
Cren7 mutants in constraining DNA supercoils may not 
be attributed solely to the decrease in the affinity of these 
mutants for the DNA, since P30A, which showed a much 
reduced affinity for DNA as compared to wild-type Cren7, 
constrained DNA supercoils as efficiently as the wild-type 
protein. A possibility exists that the positively charged resi- 
dues on loop 63-4 are primarily involved in the deforma- 
tion of DNA by Cren7. It is noticed that the Lys31 residue 
undergoes reversible methylation (Guo et al. 2008). Lysine 
methylation in eukaryotic histones has been shown to play 
a key role in the regulation of chromosomal structure and 
gene expression (Grant 2001; Hansen et al. 2006). A simi- 
lar regulatory role may also be proposed for Lys31, which 
distorts the DNA backbone in the Cren7-DNA crystals 
through the formation of hydrogen bond to the phosphate 
group of the DNA. 

The two highly conserved glycine residues on loop B3- 
B4 (Gly32 and Gly35) appear to be critical to the proper 
folding of Cren7. Mutation of either of the two residues led 
to dimerization of the protein (Fig. 4). The structural basis 
for the dimerization of the Cren7 mutants is unclear. It has 
been reported that mutation of a glycine residue (Gly55) 
promotes the conversion of the IgG-binding domain of pro- 
tein L from a monomeric form into a dimeric form at a pro- 
tein concentration of above 30 uM (Kuhlman et al. 2001). 
The 62-residue domain consists of an a-helix packed on a 
four-stranded B-sheet formed by two ß-hairpins. Alanine 
substitution of Gly55, located in the C-terminal hairpin, 
straightens the four-residue turn in this hairpin, allow- 
ing the C-terminal f strand to extend out and insert into a 
partner protomer to form a domain-swapped dimer. Con- 
ceivably, the mutant Cren7 proteins may form dimers in a 
similar manner to the IgG-binding domain of protein L. It 
is possible that, once the glycine residue is mutated, loop 
B3-B4 becomes less flexible, and the straightened loop per- 
mits the 64 and 65 strands to flip out. The two $ strands 
then form an intact 6 barrel with the B1, 82 and 63 strands 
from another Cren7 mutant protomer. Dimerization of the 
mutant Cren7 protein would presumably be favored in 
vivo, given the millimolar levels of the protein present in 
Sulfolobus. Therefore, the two glycine residues appear to 
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be essential in maintaining the flexibility of loop B3-B4 and 
the monomeric form of Cren7 in vivo. 

The presence of a much longer loop between 63 and B4 
strands in Cren7 than that in Sul7d raises a question about 
the role of the size of the loop on the properties of the pro- 
teins. Deletion of five residues in the middle of loop 83-64 
resulted in drastic loss in both DNA-binding and supercoil- 
constraining abilities of Cren7. The loss of function of the 
shortened loop was not effectively compensated even when 
the loop was a small two-residue hinge (i.e., Gly29—Arg30 
for GR or Gly29-Lys30 for GK), which mimicked that in 
Sac7d (Figs. 2, 3). Comparison of the crystal structures of 
wild-type Cren7 and GR in complex with dsDNA reveals 
the structural basis for the possible functional differences 
between the long loop and the small hinge. First, GR occu- 
pies a binding site (~6 bp) smaller than that of wild-type 
Cren7 (~8 bp). Second, no base pair contacts by GR were 
observed in the region from G1-C16 to G3-C14, and thus 
the DNA strand appears much smoother than that bound by 
the wild-type protein. Third, DNA is less bent in the GR- 
DNA complex (~59.6°) than in the wild-type Cren7-DNA 
complex (~69.4°). Given the potential effect of DNA bend- 
ing on DNA compaction, possession of the long loop is 
important for Cren7 to play a role in chromosomal DNA 
organization. Fourth, GR but not Cren7 narrows the major 
groove at the intercalation site. In this regard, GR shows 
greater resemblance to Sul7d than to Cren7. In fact, as com- 
pared to Cren7, both GR and Sul7d possess smaller DNA- 
binding sizes, and bend DNA without severely distorting 
the backbone. An exception is that the DNA curvature 
induced by GR is smaller than that by Sul7d. We infer from 
these results that the size of loop 83-84 may influence the 
width of the major groove of dsDNA bound by the chroma- 
tin protein and, therefore, the chromosomal accessibility of 
other DNA-binding proteins that function in various DNA- 
associated activities. The distinct difference in the impact 
between Cren7 and Sul7d on the major groove of DNA is 
consistent with non-redundant functions of the two chro- 
matin proteins in DNA packaging and transactions. 
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